SNR-dependent background noise compensation of PESQ values for cellular phone speech

نویسندگان

Kengo Fujita

Tsuneo Kato

Hideaki Yamada

Hisashi Kawai

چکیده

To evaluate the speech quality of actual cellular phone systems with an objective assessment, PESQ values were compared with MOS values for speech with background noises via four cellular phone systems used in Japan. As PESQ value errors were observed to be SNR-dependent, two SNR-dependent background noise compensation methods for PESQ values are proposed. Applying the compensation methods to the speech for four cellular phone systems, the RMSEs between MOS and compensated PESQ values were reduced to less than half of the original RMSEs for all four cellular phone systems. They were equal to the level of RMSE of MOS values.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...

متن کامل

Blind source extraction based on a direction-dependent a-priori SNR

In many hands-free applications, we encounter a speaker located in the near-field embedded in diffuse far-field noise. In this paper, we contribute an algorithm to estimate the speech and noise power spectral density (PSD) based on a directiondependent SNR (DD-SNR). The only prior knowledge needed is a model of the diffuse noise sound field. The enhanced speech signal is obtained by a parametri...

متن کامل

Distributed multichannel speech enhancement based on perceptually-motivated Bayesian estimators of the spectral amplitude

In this study, the authors propose multichannel weighted Euclidean (WE) and weighted cosh (WCOSH) cost function estimators for speech enhancement in the distributed microphone scenario. The goal of the work is to illustrate the advantages of utilising additional microphones and modified cost functions for improving signal-to-noise ratio (SNR) and segmental SNR (SSNR) along with log-likelihood r...

متن کامل

Feature Compensation for Speech Recognition in Severely Adverse Environments Due to Background Noise and Channel Distortion

This paper proposes an effective feature compensation scheme to address severely adverse environments for robust speech recognition, where background noise and channel distortion are simultaneously involved. An iterative channel estimation method is integrated into the framework of our Parallel Combined Gaussian Mixture Model (PCGMM) based feature compensation algorithm [1]. A new speech corpus...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

SNR-dependent background noise compensation of PESQ values for cellular phone speech

نویسندگان

چکیده

منابع مشابه

Speech enhancement based on hidden Markov model using sparse code shrinkage

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

Blind source extraction based on a direction-dependent a-priori SNR

Distributed multichannel speech enhancement based on perceptually-motivated Bayesian estimators of the spectral amplitude

Feature Compensation for Speech Recognition in Severely Adverse Environments Due to Background Noise and Channel Distortion

عنوان ژورنال:

اشتراک گذاری